Reassessing the Canon: “fixed” phrases in general reference corpora

نویسنده

  • Gill Philip
چکیده

This paper sets forth the argument for revisiting fixed phrases in the light of the knowledge that their fixedness is not necessarily something to be taken for granted. It focuses on the location and analysis of variant forms in general reference corpora. Existing phraseological structures, including collocational frameworks, idiom schemas and semi-prepackaged phrases, are introduced by way of background before a procedure for retrieving non-canonical forms of fixed expressions in general reference corpora is presented. Some implications relating to the study of variant forms are presented, along with suggestions for future research directions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The quantitative conversion of the component composition of steady-phrases

The Article is devoted to one of the methods of the conversion of fixed expressions (proverbs, sayings, aphorisms and such cliché’ sentences) in the modern Russian language – to reduce their component composition (implizieren, implications). Question quantitative changes in the steady phrases are considered in the aspect of the General problem of phraseological variability. T...

متن کامل

Learning Translations of Named-Entity Phrases from Parallel Corpora

We develop a new approach to learning phrase translations from parallel corpora, and show that it performs with very high coverage and accuracy in choosing French translations of English named-entity phrases in a test corpus of software manuals. Analysis of a subset of our results suggests that the method should also perform well on more general phrase translation tasks.

متن کامل

Adapting language models for frequent fixed phrases by emphasizing n-gram subsets

In support of speech-driven question answering, we propose a method to construct N-gram language models for recognizing spoken questions with high accuracy. Question-answering systems receive queries that often consist of two parts: one conveys the query topic and the other is a fixed phrase used in query sentences. A language model constructed by using a target collection of QA, for example, n...

متن کامل

A Continuum-Based Approach for Tightness Analysis of Chinese Semantic Units

Chinese semantic units fall into a continuum of connection tightness, ranging from very tight, non-compositional expressions, tight compositional words, phrases, and then to loose more or less arbitrary combinations of words. We propose an approach to measure tightness connection within this continuum, based on document frequency of segmentation patterns in a reference corpus. A variety of corp...

متن کامل

Recognition of non-domain phrases in automatically extracted lists of terms

In the paper, we address the problem of recognition of non-domain phrases in terminology lists obtained with an automatic term extraction tool. We focus on identification of multi-word phrases that are general terms and discourse function expressions. We tested several methods based on domain corpora comparison and a method based on contexts of phrases identified in a large corpus of general la...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006